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Quantum theory violates Bell's inequality, but not to the maximum extent that is logically possi- 
ble. We derive inequalities (generalizations of Cirel'son's inequality) that quantify the upper bound 
of the violation, both for the standard formalism and the formalism of generalized observables 
(POVMs). These inequalities are quantum analogues of Bell inequalities, and they can be used to 
test the quantum version of locality. We discuss the nature of this kind of locality. We also go into 
the relation of our results to an argument by Popescu and Rohrlich (Found. Phys. 24, 379 (1994)) 
that there is no general connection between the existence of Cirel'son's bound and locality. 
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I. INTRODUCTION 

The violation of Bell's inequality by the predictions 
of quantum theory (both in its non-relativistic and rela- 
tivistic versions) shows that quantum theory is non-local 
in the sense that its results cannot be reproduced by a 
hidden- variables theory in which measurement results de- 
pend only on the local settings of the measuring devices 
and on the properties of the objects being measured (a 
local hidden- variables theory). However, the maximum 
violation of Bell's inequality allowed by quantum theory 
is less than the maximum violation that is logically pos- 
sible: quantum theory obeys Cirel'son's inequality. One 
might surmise that this is due to the fact that quan- 
tum theory does not abandon locality completely: after 
all, in situations of the Einstein-Podolsky-Rosen (EPR) 
type the measurements performed on one wing do not 
influence expectation values on the other wing (the no- 
signaling theorem; in the relativistic context this is the 
feature of relativistic causality). Perhaps compliance 
with a no-signaling demand restricts the extent to which 
Bell's inequality can be violated, and perhaps inequali- 
ties like Cirel'son's can be regarded as a touchstone of 
this kind of locality (in the same way as Bell's inequality 
is a touchstone for locality in the classical sense). 

In this paper we show that this hypothesis is right: 
the fact that quantum theory docs not violate Bell's in- 
equality to the maximum logically possible extent is due 
to features of locality that are built into the theory. We 
derive a set of inequalities, and a strongest inequality rep- 
resenting this whole set, that can be regarded as quan- 
tum versions of Bell's inequality. To make the analogy 
with Bell inequalities clear we will analyze how locality is 
implemented in quantum theory, and in what sense the 
quantum theoretical inequalities we derive are based on 
locality assumptions. We will discuss how this relates to 
a result of Popescu and Rohrlich [p| that at first sight 



seems to show that the existence of Cirel'son's bound is 
unconnected with locality issues. 



II. CIREL'SON'S INEQUALITY 

Consider a probability space in which there are four 
stochastic functions. A, a, B, h, each of which can take 
the values +1 or — 1 . The quantity AB + Ab + aB — ab = 
A{B -I- 6) -f- a{B — b) can only be +2 or —2, from which 
it follows that the absolute value of its expectation value 
is smaller than 2: 

\(AB + Ab + aB-ab)\ < {\AB + Ab + aB~ab\)=2. (1) 

This is the form of Bell's inequality that we will con- 
sider. The inequality is respected by physical quantities 
in classical theories, as long as these quantities can be 
represented by (stochastic) functions on one state space, 
with a joint probability distribution — which is ordinarily 
the case. We will discuss the connection with locality in 
sect. 0. 

In quantum mechanics physical magnitudes are not 
represented by stochastic functions on a phase space, but 
by Hermitian operators on a Hilbert space. Let us now 
use A, a, B, b to denote such operators that have eigen- 
values -f 1 and —1, and let us consider a combination of 
them that is analogous to the combination of quantities 
in Bell's inequality: AB + Ab + aB — ab, where we as- 
sume that the operators occurring in a product commute. 
As was first shown by Cirel'son the modulus of the 
quantum mechanical expectation value of this expression 
is bounded by 2^2: | {AB + Ab + aB - ab)\ < 2^2— the 
upper bound can be attained, as shown by the example 
of the singlet state. So Bell's inequality can be violated 
by quantum theory; but the quantum expectation value 
stays well below the logically possible upper bound of the 
expression \{AB + Ab + aB — ab)\, namely 4. 

Cirel'son's inequality can be proved elegantly by ob- 
serving j|] that if ^2 = = = 6^ = 1 and 
[A, B] = [A, b] ^ [a, B] ^ [a, b] ^ 0, then 



'Electronic address: dieks@phys.uu.nj 



(AB + Ab + aB - abf =41- [A,a][B,b]. 
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It follows from this that 

(C)2<(C2)<||C|p<4 + 4||^||||a||||i?||||&||=8, 

or 

\{C)\<2V2. 

An alternative simple proof, which is analogous to the 
above proof of Bell's inequality (|^) and similar to proofs 
of other inequalities that we will give in sections III and 
1^, goes as follows. 

For a normed state vector |?/;), put A\il}) = \A), iJjV') = 
a 1-0) = I a) and h\'ip) = \h). Each of these four vectors 
has a norm that is < 1. We now have 

i(c^>i = mem 

= \{A\B + b) + {a\B -b)\ 

< \\\ B) + \b)\\ + \\\B )-\b)\\ 

< ^/2{l + Re(S|6)) + ^2(1 - Re(B|6)) 



< 2V2. 



(2) 



The difference between this derivation and the deriva- 
tion of Bell's inequality is that for numbers B and b with 
norm < 1 we have |_B+6| + |-B — fo| < 2, whereas for vectors 
with norm < 1 we find \\\B) + |fe)|| + - |&)|| < 2^/2. 
In the latter case the maximum is attained when \B) and 
\b) are perpendicular. 

In derivation (|l^ the essential premise is that the op- 
erators A, a commute with _B, b. At first sight, derivation 
does not make use of this premise. This impression is 
deceptive, however. The operator products occurring in 
C are hermitian operators (and therefore representations 
of physical quantities) if and only if the operators that are 
multiplied commute, and this leads to exactly the same 
commutativity requirement as in Q). One physical con- 
sequence of this commutativity requirement is that the 
operators A and a are jointly measurable with the opera- 
tors B and b. Moreover, it follows from the commutativ- 
ity that it does not make any difference for the expecta- 
tion values of the operators B or 6 whether they are mea- 
sured together with A or a (the no-signaling theorem). 
Within the framework of the orthodox measurement for- 
malism co-measurability and causal independence (in the 
sense of no signaling) therefore go together: they both 
hold if and only if the commutativity requirement is sat- 
isfied. In this case Cirel'son's inequahty also holds. 



III. GENERALIZED MEASUREMENTS 

Above we followed the orthodox point of view about 
the mathematical representation of physical quantities in 
quantum theory, namely that physical quantities are rep- 
resented by hermitian operators. Within this framework 
joint measurability is equivalent to commutativity (which 
in turn leads to the no-signaling theorem in the context 
of the EPR experiment). But there is a more general 



treatment of measurements in quantum theory, first de- 
veloped by Ludwig Q and Davies 1^, in which physical 
quantities correspond not to single operators but to col- 
lections of positive operators Mi on the Hilbert space, 
such that 



M, 



>0, ^Af, 



1. 



If the possible outcomes of a measurement of the consid- 
ered quantity are m^, the probabilities of obtaining these 
values in a state p of the system are given by TrpAf^. The 
mapping mi Mi is a positive operator valued mapping 
(POVM), representative of the associated physical quan- 
tity M. 

Two physical quantities A and B, represented by sets 
of positive operators {Ai} and {Bj}, respectively, are 
jointly measurable if there is a third quantity O, repre- 
sented by {Ok}, such that 



A, 



B, 



(3) 



keKi 



keK'. 



where {Ki} and {K'^} are two partitions of the index set 
through which k runs. 

If there is an O satisfying Eq.(|) we can measure it, 
and infer information about the outcomes and their prob- 
abilities of both A and B by grouping together the results 
according to the two partitions. An important feature of 
this formalism is that commutativity of the two general- 
ized observables A and B (in the sense that AiBj = BjAi 
for all is a sufficient but not a necessary condition 
for their joint measurability. // A and B commute, the 
products AiBj are positive operators characterizing the 
joint measurement of A and B. But in general a joint 
measurement need not correspond to product operators 
(see for a critical analysis of the significance of these re- 
sults §). 

So in the EPR situation we could imagine a joint mea- 
surement of two non-commuting generalized observables 
A and B, each pertaining to a different wing of the ex- 
periment. In this case it would no longer be true that 
the mere requirement of compatibility leads to causal in- 
dependence (no-signaling), the product form of the joint 
measurement operators, and the validity of Cirel'son's 
inequality. 

However, Busch and Singh have shown for the EPR 
situation, treated by means of the POVMs formalism, 
that if the possible values and their probabilities of the 
quantity measured at one wing are required to be inde- 
pendent of which quantity is measured at the other side, 
the operators representing the generalized observables at 
one wing must commute with those at the other. It fol- 
lows that in this case the operators corresponding to the 
joint measurement take on the product form again. So 
within the generalized measurements framework commu- 
tativity and product form are consequences of locality, in 
the sense of the impossibility of signaling. 

If the measurements in the EPR experiment are repre- 
sented by generalized observables, and if locality in the 
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sense of impossibility of signaling is assumed, Cirel'son's 
inequality can again be derived. To see this, consider 
one pair of the four pairs of observables, A and B, say. 
Because of the no-signaling requirement, the correspond- 
ing positive operators Ai and Bj commute, and the joint 
measurement of A and B can be represented by four pos- 
itive operators AiBj, with i,j = ±1. The expectation 
value of the outcomes of this joint measurement, in the 
pure state becomes: 

= (V-KAi - - B^,m = {A\B), (4) 

where \A) = {Ai - A^i)\ij) and \B) = [Bi - 
So for the purpose of calculating expectation values the 
generalized observables A, B, a, b can each be represented 
by a single hermitian operator, namely {Ai — A^i), 
(Bi — B-i), (ai — a_i) and {bi — respectively; the 
joint measurements are represented by the corresponding 
products. Compared to the case discussed in sect. ||, the 
differences are that the operators Ai, Bj, . . . need not be 
projection operators, and the squares of {Ai—A-i), {Bi — 
B-i), . . . need not be 1. The second proof of the Cirel'son 
inequality given in section || goes nevertheless through, 
because the operators {Ai — A^i), [Bi — B^i), ... all have 
norms < 1. 
Indeed, 



\\{A,-A^,)m' 
= \\A,m? + \\A. 



l|^)|P-2(AiV|A_i^), (5) 



whereas 



11(^1 +^-i)l^>lP 

= \\A,m\^ + \\A^,\m^ 



2(AiV'|A_iV) = 1,(6) 



so that 



||(Ai-A_i)|V')|p-l-4(AiV|A_iV). 



(7) 



Because A_i = 1 — Ai, [Ai,A^i\ = and the inner 
products in (||), (|6|) and (^ are real. This inner product 
is also > 0: 



(AiV|A_i^) = (^|Ai|V)-(V'|A2|V), 



(8) 



which is > because Ai has norm < 1 and only has 
eigenvalues with < Ai < 1. 

Now introduce vectors |a), |6) in the obvious way: \a) = 
(ai — a-i)\^p),\b) = (&i — 6_i)|'0). It follows from the 
above that the vectors \ A) ,\B) ,\a) ,\b) all have norms < 
1, just as the vectors denoted by the same symbols in 
sect. ||. Repeating the proof (||), we therefore find: 

\{AB + Ab + hB - hb)\ 

^\{A\B + b) + {a\B-b)\<2V2. (9) 

This inequality holds in every pure state \'ip). Its validity 
in any mixed state p follows immediately. 
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FIG. 1: The X,Y plane. The slanted square represents in- 
equalities ( |ic| ) plus (|l^), the circle inequality dis" 



IV. THE STRONGEST INEQUALITY 

Cirel'son's inequality is not the only nor the strongest 
one that can be derived from the locality (no-signaling) 
and therefore commutativity requirement. Put X = 
{Ab + aB) and Y = {AB — ab). Cirel'son's inequality 
can now be written as 



\X + Y\< 2V2. 



(10) 



In the X, Y 'correlation plane' this inequality restricts 
the points {X, Y) to the strip between the two lines 



X + Y = ±2V2. 



(11) 



But by a minimal change in the proof of sect. ^ it imme- 
diately follows that also the following inequality holds: 



\X-Y\< 2V2, 



(12) 



so that the points must also lie in the strip bounded by 
the lines 



X -Y = ±2V2. 



(13) 



We also have the obvious inequalities |X|<2, |1^|<2, 
so that the allowed points {X, Y) must be in the inter- 
section of the interiors of the two squares indicated in 
Fig. 1. It further turns out that they must be inside 
(or on the sides of) all squares that result from these 
just-mentioned squares by applying an arbitrary rota- 
tion around an axis through the origin of the X, Y plane 
and normal to this plane. To prove this, consider the 
expression |Xsin(/j -I- y cos(/?|. We have: 



\X sin ip + Y cos Lp\ 
— 1(^1 S cos -f- 6 sin (^) 4 
< II |i?cos(p) -f |6sinc/j)|| 



{a\B sin Lp — b cos | 
- II \Bsu\Lp) — \bcosip)\\ 



< \/sin^ + cos^ + 2Re(i?|6) siniy9COS(y5 



sin^ p + cos^ — 2Re(_B|&) shupcoaip 



< J sin^ ip + cos^ p + y sin^ p + cos^ p = 2. (14) 
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Cirel'son's inequality and the other inequalities men- 
tioned earlier in this section are special cases of this gen- 
eral set of inequalities (in which ip can take arbitrary 
values). It should be noted that these proofs apply both 
to the case of ordinary observables and to the case of 
generalized observables. 

It is clear from the geometry of Fig. 1 that the require- 
ment that all the inequalities (^4|) be satisfied leads to 
the inequality 



X^ + Y^ = {Ab + ~aBf + {AB - hbf < 4. 



(15) 



All points X, Y are inside or on the circumference of a 
circle with radius 2. 

Inequality (^5|) (which was recently proved directly, 
by a variational argument, for the case of ordinary spin 
observables by Ufhnk [B l) summarizes all generalized 
Cirel'son inequalities (|lj). All values X, Y that satisfy 
( p^ ) also satisfy all Cirel'son inequalities (111); but satis- 



faction of a finite number of inequalities of (14) is not 
sufficient to guarantee satisfaction of (|l5|). Moreover, 
each point on the circumference of the circle can actu- 
ally be attained, because the bound of the corresponding 
generalized Cirel'son inequality can be attained (the one 
resulting in a line tangent to the circle in the point in 
question). Inequality dl3) is therefore the strongest in- 
equality in terms of X, Y that follows from the require- 
ment of commutativity. 



V. LOCALITY 

Bell's inequality (|^) is valid for an arbitrary quadruple 
of stochastic functions on one probability space, and as 
such is not immediately connected with locality issues. 
The link with locality comes in via the application of 
(|l|) to situations of the Einstein-Podolsky-Rosen type in 
which A and a, and B and b, stand for measurements on 
the space- like separated wings 1 and 2 of the experiment, 
respectively. An experimenter at 1 can choose between 
measuring A and a; her or his colleague at 2 has the 
choice between B and b. The combined measurement on 
parts 1 and 2 is represented by the product of the indi- 
vidual single system result functions. This is justified by 
a locality assumption: a measurement of a physical quan- 
tity on one wing of the experiment has no influence at 
the other wing. On the basis of this assumption one- wing 
quantities are represented by one and the same function, 
regardless of whether, and if so which, measurement is 
performed at the other side. Both the possible measure- 
ment outcomes and their probabilities are insensitive to 
choices made at the other side. 

An obvious silent assumption in this is that A, a, B, b 
correspond to characteristic measurement devices and in- 
teractions. The device corresponding to A, e.g., should 
remain the same in different instances — that the possi- 
ble outcomes and corresponding probabilities remain the 
same is by itself not enough. Consider to make this clear 



a Stern-Gerlach device at wing 1 that undergoes a rota- 
tion depending on the choice between B and b: although 
the possible outcomes would still be -1-1 and —1 and the 
probabilities would remain equal to 1/2 (if the device 
measures the spin of a spin-1/2 particle), this would not 
constitute one specific measured quantity. Spin along 
different axes would be measured. In spite of the fact 
that all measurement results could be represented by the 
same function A^ the rotation of the corresponding device 
would signal non-locality. Within the quantum formal- 
ism such a non-invariance of the measuring procedure 
could easily lead to a violation of (|T^). An explicit ex- 
ample can be constructed by stipulating that the con- 
crete physical implementation of measuring AB and the 
other joint quantities in (|l5|), in the two-particle singlet 
state, be: "Measure cr^ and — cr^^, call the results 'spin 
of particle / along axis a and spin of particle // along 
axis /?, respectively'; perform the same measurement of 
CT^ and —(7^^ to obtain the spin values for the pairs of 
axes a',/3 and a,/?', and measure and +(7^^ in the 
case of a' , f3'^\ Obviously, the correlation functions ob- 
tained in this way violate Cirel'son's inequality maxi- 
mally (even though outcomes and probabilities at each 
wing are insensitive to what choice is made at the other 
wing). This is because the quantities defined in this way 
are not bona-fide local physical quantities in the sense we 
have discussed. Indeed, according to this measurement 
protocol the measuring procedure for the spin of particle 
// along (3' depends on whether spin along a or along a' 
is measured on particle /. 

So we have identified a first locality requirement: the 
operators that are used to represent physical quantities 
on the individual wings of the experiment should refer 
to the same physical devices and interactions, regardless 
of what goes on at the other side. It should be possible 
to measure these quantities on wing 1 and wing 2, re- 
spectively, together; therefore the operators representing 
them should be compatible (i.e., commuting or jointly 
measurable in the sense of the POVM scheme). 

The second locality assumption to be considered is that 
no signals are transmitted: the possible outcomes and 
their probabilities on the two sides of the experiment are 
insensitive to what happens at the other wing. Usually, 
this is the only assumption that is explicitly discussed. 

Within the orthodox treatment of measurements in 
quantum theory the compatibility assumption and the 
no-signaling assumption are equivalent: both lead to the 
requirement that the operators at one side commute with 
those at the other side. This commutativity is in turn suf- 
ficient to derive Cirel'son's inequality and its generaliza- 
tions, including inequality (|l^). Within the framework of 
generalized measurements it is the no-signaling require- 
ment that leads to commutativity; so here we have to 
invoke locality (in the sense of no signaling, or relativis- 
tic causality in the relativistic context) explicitly in order 
to derive (pq); the requirement of joint measurability is 
not enough. 

The two just-mentioned locality requirements together 
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lead to a theoretical description with one-wing operators 
(referring to invariant measuring procedures) that com- 
mute with those at the other win g. T his is sufficient for 
deriving the inequalities of section IV. These inequalities 
can therefore be used in experimental tests of locality. If 
an inequality is violated in experiments, this indicates 
either the propagation of influences that change the out- 
comes and/or their probabilities (an application of this 
idea can be found in |^), or it indicates non-invariance 
of the measured quantity (see above for an example of 
the latter possibihty).pl| 



VI. AN ARGUMENT BY POPESCU AND 
ROHRLICH 

The conclusion of the previous Section is that quan- 
tum mechanical locality (no-signaling) is responsible for 
the existence of the upper bound of Cirel'son's inequal- 
ity. This conclusion might seem in conflict with an argu- 
ment by Popescu and Rohrlich These authors argue 
that the impossibility of signaling does not limit the sum 
of the correlations occurring in Cirel'son's inequality to 
2^/2. Their counterexample is an EPR situation in which 
spin measurements are performed on the two wings. For 
the outcomes of these measurements a particular prob- 
ability distribution is postulated, as follows. The two 
possible outcomes are taken to be -1-1 and —1 along any 
axis, and both of these possibilities are postulated to have 
a probability of 1/2, independently of the measurement 
performed at the other wing. So the measured outcomes 
and their probabilities do not give any information about 
choices made at the other side; it is impossible to signal, 
or, as Popescu and Rohrlich put it, relativistic causality 
is satisfied. 

Further, for any pair of axes, the combinations of out- 
comes -|- 1,-1-1 and —1,-1 are assumed to be equally 
probable, and the same applies to the combinations 
+1,-1 and — 1,+1. Finally, the correlation function 
(a 'superquantum' correlation function) is stipulated to 
have a form like the following: 



E{e) 



+1 for < 61 < 7r/4 

2 - AO/tt for 7r/4 < 6* < 37r/4 
-1 for 37r/4 < 6* < tt 



(16) 



This is equivalent to assuming that the probability 
p++{0) of the pair of outcomes -1-1, —1 is given by 



P++{0) 



E{e) -f 1 



In these formulas 9 is the angle between the axes on the 
left and right, respectively, along which the spin mea- 
surements are made. 

Now consider four axes a', (3, a, (3' separated by suc- 
cessive angles of 7r/4 and lying in one plane. We find 
that 



Cirel'son's inequality can therefore be violated, even to 
the maximum extent logically possible, by a correlation 
function that respects invariance of one-wing outcomes 
and probabilities, and thus the impossibility of signaling. 

Clearly, therefore, the requirement that outcomes and 
probabilities are invariant is by itself insufficient to de- 
rive Cirel'son's inequality and its generalizations. Nev- 
ertheless, we have demonstrated above that Cirel'son's 
inequality can be derived from that locality assumption 
within the theoretical framework of quantum mechan- 
ics. It is only when the no-signaling requirement is im- 
plemented within a well-defined theoretical framework, 
equipped with prescriptions for how to represent observ- 
ables (corresponding to characteristic measuring proce- 
dures), that it gets enough bite to make the derivation 
of Bell-type inequalities possible. Indeed, we have seen 
that within the framework of theories that operate with a 
phase space on which physical quantities are represented 
by (stochastic) functions, the original Bell inequalities 
can be obtained, whereas within the Hilbert space formal- 
ism of quantum theory inequality (^) results. In both 
cases the correlation function postulated by Popescu and 
Rohrlich cannot arise in a local way (it can be produced 
by non-local means as was illustrated earlier in this sec- 
tion). 

The fact that within the framework of quantum the- 
ory the correlation function ( p^ cannot be produced in 
a local way (because it violates (|l5|)) does not mean, of 
course, that there cannot exist other theoretical frame- 
works in which 'superquantum' correlation functions 
could arise in a local way; frameworks that use neither 
functions on a state space nor the Hilbert space operator 
formalism. It is difficult to say anything definite about 
such hypothetical theoretical frameworks. Popescu's and 
Rohrlich's argument does not use any assumption about 
how the observables are represented mathematically, and 
therefore does not provide a sufficient basis for a discus- 
sion of locality and causality (in fact, we already observed 
that their probability distribution could be produced by 
non-local mechanisms, which implies that invariance of 
outcomes and probabilities is a necessary but not a suf- 
ficient condition for causality). 

Summing up, Popescu and Rohrlich are right in their 
claim that Bell-type inequalities do not follow from the 
no-signaling requirement alone. But this does not answer 
the question of whether locality, in the sense of the no- 
signaling requirement, may lead to such inequalities in 
the context of specific theories. It turns out that if locality 
is fleshed out within classical theories, this leads to Bell's 
inequality; if it is fleshed out within the framework of 
quantum theory this leads to inequality (p^). 
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